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1 An information-theoretic approach to normal forms for relational and XML data | 
Marcelo Arenas, Leonid Libkin 

June 2003 Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART 
symposium on Principles of database systems 

Full text available: ^ pdf(309.44 KB) Additional Information: full citation , abstract , references , index terms 

Normalization as a way of producing good database designs is a well-understood topic. 
However, the same problem of distinguishing well-designed databases from poorly designed 
ones arises in other data models, in particular, XML. While in the relational world the 
criteria for being well-designed are usually very intuitive and clear to state, they become 
more obscure when one moves to more complex data models. Our goal is to provide a set of 
tools for testing when a condition on a database design, ... 

2 A Web Odyssey: from Codd to XML | 
Victor Vianu 

May 2001 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on 
Principles of database systems 

Full text available: ^| pdf(2 82.10 K B) Additional Information: full citation, refer en ce s , citings, i n d ex t er m s 



XRel: a path-based a p proach to stora ge and retrieval of XML documents u s in g 
relational databases 

August 2001 ACM Transactions on Internet Technology (TOIT), Volume l issue i 

Additional Information: full citation , abstract , references , citings , index 
terms, review 



Full text available: f l pdf(264.27 KB) 



This article describes XRel, a novel approach for storage and retrieval of XML documents 
using relational databases. In this approach, an XML document is decomposed into nodes 
on the basis of its tree structure and stored in relational tables according to the node type, 
with path information from the root to each node. XRel enables us to store XML documents 
using a fixed relational schema without any information about DTDs and also to utilize 
indices such as the B+ 

Keywords: XML query, XPath, text markup, text tagging 
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Sara Comai, Ernesto Damiani, Piero Fraternali 

October 2001 ACM Transactions on Information Systems (TOIS), Volume 19 issue 4 

Full text available- pdf(707 80 KB) Additional Information: full citation, abstract , references , citings , index 

: terms 

The rapid evolution of XML from a mere data exchange format to a universal syntax for 
encoding domain-specific information raises the need for new query languages specifically 
conceived to address the characteristics of XML. Such languages should be able not only to 
extract information from XML documents, but also to apply powerful transformation and 
restructuring operators, based on a well-defined semantics. Moreover, XML queries should 
be natural to write and understand, as nontechnical person ... 

Keywords: Document restructuring, graphical query languages, semantics 



s Special section on advanced XML data processing: A general technique for querying 
X M L docum e nt s using a relational database syst e m 

Jayavel Shanmugasundaram, Eugene Shekita, Jerry Kiernan, Rajasekar Krishnamurthy, 
Efstratios Viglas, Jeffrey Naughton, Igor Tatarinov 
September 2001 ACM SIGMOD Record, Volume 30 Issue 3 

i- ii * * i L. a ,^ JI , 00 ,,r, x Additional Information: full citation , abstract , references , citings , index 
Full text available: ^gj pdf(645.23 KB) terms — 

There has been recent interest in using relational database systems to store and query XML 
documents. Each of the techniques proposed in this context works by (a) creating tables for 
the purpose of storing XML documents (also called relational schema generation), (b) 
storing XML documents by shredding them into rows in the created tables, and (c) 
converting queries over XML documents into SQL queries over the created tables. Since 
relational schema generation is a physical database design ... 

6 A normal form for XML documents 
Marcelo Arenas, Leonid Libkin 

March 2004 ACM Transactions on Database Systems (TODS), volume 29 issue i 

Full text available: ^ pdf(439.02 KB) Additional Information: full citation , abstract , references , index terms 

This article takes a first step towards the design and normalization theory for XML 
documents. We show that, like relational databases, XML documents may contain 
redundant information, and may be prone to update anomalies. Furthermore, such 
problems are caused by certain functional dependencies among paths in the document. Our 
goal is to find a way of converting an arbitrary DTD into a well-designed one, that avoids 
these problems. We first introduce the concept of a functional dependency for XM ... 

Keywords: DTDs, XML data, design, functional dependencies, normal form 



7 Document Databases: Bridgin g XML-schema and relational databases: a system for Q 
generating and manipulating relational databases using valid XML documents 
Iraklis Varlamis, Michalis Vazirgiannis 

November 2001 Proceedings of the 2001 ACM Symposium on Document engineering 

Full text available: ^l| pdf(1 30.57 KB) Additional Information: full citation , abstract , references , index terms 

Many organizations and enterprises establish distributed working environments, where 
different users need to exchange information based on a common model. XML is widely 
used to facilitate this information exchange. The extensibility of XML allows the creation of 
generic models that integrate data from different sources. For these tasks, several 
applications are used to import and export information in XML format from the data 
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repositories. In order to support this process for relational reposito ... 

Keywords: XML, document storage and retrieval, mapping, metadata, querying, relational 
databases 



8 On XML integrity constraints in the presence of DTDs 
Wenfei Fan, Leonid Libkin 

May 2002 Journal of the ACM (JACM), Volume 49 issue 3 



The article investigates XML document specifications with DTDs and integrity constraints, 
such as keys and foreign keys. We study the consistency problem of checking whether a 
given specification is meaningful: that is, whether there exists an XML document that both 
conforms to the DTD and satisfies the constraints. We show that DTDs interact with 
constraints in a highly intricate way and as a result, the consistency problem in general is 
undecidable. When it comes to unary keys and foreign keys ... 

Keywords: Consistency, DTDs, XML, implication, integrity constraints 



9 A declarative approach to optimize bulk loading into databases 
Sihem Amer-Yahia, Sophie Cluet 

June 2004 ACM Transactions on Database Systems (TODS), Volume 29 issue 2 



Applications, such as warehouse maintenance, need to load large data volumes regularly. 
The efficiency of loading depends on the resources that are available at the source and at 
the target systems. Our work aims to understand the performance criteria that are involved 
in bulk loading data into a database and to devise tailored optimization strategies. Unlike 
commercial systems and previous research on the same topic, our approach follows the 
fundamental database principle of physical-logical ind ... 

Keywords: Declarative bulk loading, algebra, recovery, side-effects 

10 Data management issues in electronic commerce: The design and performance 
evaluation of alternative XML storage strategies 

Feng Tian, David J. DeWitt, Jianjun Chen, Chun Zhang 
March 2002 ACM SIGMOD Record, Volume 31 issue l 

Full text available: ||| pdf(484.91 KB) Additional Information: full citation , abstract , references , citing s 

This paper studies five strategies for storing XML documents including one that leaves 
documents in the file system, three that use a relational database system, and one that 
uses an object manager. We implement and evaluate each approach using a number of 
XQuery queries. A number of interesting insights are gained from these experiments and a 
summary of the advantages and disadvantages of the approaches is presented. 

11 Database theo r y, technolo gy a n d ap plication s ( DTTA) : O n u sing collection for 
aggre gation and association relationships in XML object-relational storage 
Eric Pardede, J. Wenny Rahayu, David Taniar 

March 2004 Proceedings of the 2004 ACM symposium on Applied computing 

Full text available: ^ odfd 59.28 KB) Additional Information: full citation , abstract , references , index terms 

XML data can be stored in different databases including Object-Relational Database (ORDB). 
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Using ORDB, we get the benefit of the relational maturity and the richness of 00 modeling. 
One modeling concept that can be captured is the collection. Collection structures 
frequently occur in XML documents especially in two relationship types: aggregation and 
association. However, very often when the data is stored in a database repository, the 
collection is flattened. We believe that preserving the coll ... 

Keywords: ORDB, XML, XML schema, collection 



12 TIMBER: A native XML database Q 
H. V. Jagadish, S. Al-Khalifa, A. Chapman, L. V. S. Lakshmanan, A. Nierman, S. Paparizos, J. 
M. Patel, D. Srivastava, N. Wiwatwattana, Y. Wu, C. Yu 

December 2002 The VLDB Journal — The International Journal on Very Large Data 

Bases, Volume 11 Issue 4 
Full text available: ^ p d f(2 68 .3 9 KB ) Additional Information: full citation , ab stra ct, i nd ex te rm s 

This paper describes the overall design and architecture of the Timber XML database 
system currently being implemented at the University of Michigan. The system is based 
upon a bulk algebra for manipulating trees, and natively stores XML. New access methods 
have been developed to evaluate queries in the XML context, and new cost estimation and 
query optimization techniques have also been developed. We present performance numbers 
to support some of our design decisions. We believe that the key in ... 

Keywords: Algebra, Document management, Hierarchical, Query processing, Semi- 
structured 



13 An analy sis o f XML da t abase solutio ns for the management of MPE G -7 med i a 
descriptions 

Utz Westermann, Wolfgang Klas 

December 2003 ACM Computing Surveys (CSUR), volume 35 issue 4 

Full text available: ^ pdf(448.76 KB) Additional Information: full citation , abstract , references , index terms 

MPEG-7 constitutes a promising standard for the description of multimedia content. It can 
be expected that a lot of applications based on MPEG-7 media descriptions will be set up in 
the near future. Therefore, means for the adequate management of large amounts of 
MPEG-7-compliant media descriptions are certainly desirable. Essentially, MPEG-7 media 
descriptions are XML documents following media description schemes defined with a variant 
of XML Schema. Thus, it is reasonable to investigate curren ... 

Keywords: MPEG-7, XML database systems, multimedia databases 



14 Research sessions: new styles of XML: Colorful XML: one hierarchy isn't enoug h 
H. V. Jagadish, Laks V. S. Lakshmanan, Monica Scannapieco, Divesh Srivastava, Nuwee 
Wiwatwattana 
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Heterogeneity arises naturally in virtually all real-world data. This paper presents 
evolutionary extensions to a relational database system for supporting three classes of data 
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